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Abstract 

We conjecture the following so-called norm compression inequality for 2 x N parti- 
tioned block matrices and the Schatten p-norms: for p > 2, 



Ai A2 ■■■ An 
Bi B2 ■ ■ ■ Bn 



< 



\A 



2\\p 



\A 



N\\p 



\Bi\\p \\B2\\p ■ ■ ■ ||-BAf||p 



while for 1 < p < 2 the ordering of the inequality is reversed. This inequality 
includes Hanner's inequality for matrices as a special case. We prove several special 
cases of this inequality and give examples for 3x3 and larger partitionings where 
it does not hold. 
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1 Introduction 



The norm compression of a block-partitioned matrix T = w.r.t. a given 

matrix norm ||.|| is a matrix obtained from T by replacing each of its blocks 
by their norm: One can raise the question as to how the norm of 

T relates to the norm of its norm compression. In many cases one can find 
simple upper and lower bounds to one in terms of the other, giving rise to 
norm compression inequalities (NCI's); see [5] and references therein for an 
overview of several of these inequalities. 
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The Schatten p- norms, for 1 < p < oo, are unitarily invariant (UI) norms 
generalising the Ip norms to the non- commutative setting. For a general matrix 
or operator A, they are defined as 

Pllp=(Tr(|An)V^', 

which reduces for positive semi-definite (PSD) matrices A to 

P||, = (Tr(A^))VP. 

If one keeps this definition as is and considers the case < p < 1, one ob- 
tains quasi-norms. The p-norms are convex for p > 1; this is just the triangle 
inequality. For < p < 1, the p-quasi-norms are concave on the positive semi- 
definite (PSD) matrices; this is no longer true when dealing with non-PSD 
matrices, since the absolute value operation then comes into play, which is 
not concave. To prove this convexity/ concavity statement, one can for exam- 
ple apply Theorem 2 in [2] to a matrix A on ® 7i of the form A = Ai(B A2, 
with Ai^2 > 0. 

In this paper, I propose a norm compression inequality, based on the Schatten 
norms, for 2 x N block-partitioned matrices: 



T 



Ai A2--- An 
Bi B2 ■ ■ ■ Bn 



1 denote by Cp{T) its Schatten p-norm compression 

/ 



I conjecture the following: 



^l| Ip 1 1^2! Ip ■ ■ ■ II^Af Hp 
\ l|-^l||p I |-^2||p ■ ■ • II-BatIIp ^ 



Conjecture 1 Let T be a general matrix partitioned in 2 x N blocks, and let 
Cp{T) be its norm compression using the Schatten p-norm, then the following 
norm compression inequalities hold: 

\\T\\p>\\Cp{T)\\^, l<p<2 (1) 
r||,<||Cp(T)||^, p>2. (2) 

The work presented in this paper grew out of on-going work to prove Banner's 
inequality [9] for matrices. Banner's inequality proper [9j for the Lp function 
spaces is, for /, g functions in Lp, 

ii/+^ii^+ii/-^7ii^>(ii/iip+ii^iipr+iii/iip-ii/iipr, (3) 
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for 1 < p <2, and the reversed inequality for 2 < p. It is widely believed that 
these inequalities are also true for the Schatten trace ideals Cpi for general 
matrices A, B in Cp, and 1 < p < 2, 

\\A + B\\l+\\A-B\\l>{\\A\\p+\\B\\,Y+\\\A\\,-\\BU, (4) 



while for 2 < p, the inequality is reversed. This generalisation to matrices has 
been proven in a number of instances: 

(1) For 1 <p, when A + B and A - B aie PSD. 

(2) For 1 < p < 4/3, p = 2, and 4 < p < oo, when A and B are general 
matrices. 

Proofs are due to Ball, Carlen and Lieb [1] and Tomczak-Jaegermann [lOj . 

That Conjectured] implies Banner's inequality for matrices can be seen easily 
by putting N = 2, Ai = B2 = A and A2 = Bi = B. Unitarily conjugating T 

/ 

with the matrix I I , and its norm compression with 




yields 



1 1 
1 -1 



directly 



Two other special cases where Conjectured] is proven are for 2 x 2-partitioned 
block matrices T {N = 2). The case when T is PSD has been proven by King 
[7], and the case when the blocks of T are all diagonal matrices has been 
proven by King and Nathanson [8]. 

Note that, if true, ([2D is strongly sharp, which means that equality can be 
obtained for any given imposed values > and > on | |Afc| |p and | |p, 
respectively. To see this, one just takes A^ = [a^] © and B^ = [bk] © 0. 

In the following, I present equivalent forms of the Conjecture and a number of 
proofs in some further special cases. Section 2 deals with a duality argument 
to show that the 2 < p case can be derived from the 1 < p < 2 case, while the 
converse is, surprisingly, not clear at all. Equivalent forms of the Conjecture 
are given in Section 3. The central part of the paper is Section 4, in which I 
give proofs of the Conjecture for 4 special cases: 

• The norm compression of T has rank 1. 

• All blocks in T have rank 1. 

• All blocks Ak in the first row are proportional, and so are all blocks Bk in 
the second row. 

• Just as in the case of Banner's inequality, p > 4. 

For the first three special cases, it turns out that also holds for < p < 1. 
This, however, is not generally true. 
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Given the fact that Conjecture [T] is a generahsation of Banner's inequahty to 
2 X N block matrices, one may wonder whether the Conjecture could even 
be true for arbitrary M x N partitionings. It turns out that this is not the 
case. For 4x4 block matrices there are counterexamples when the blocks Aij 
are real scalars, in which case the norm-compression is just the entry-wise 
absolute value The following example already appeared in [5j: for 



/ 


3 





-2 


-2\ 







3 


2 


-1 




-2 


2 


4 





V 


-2 


-1 





3y 



one finds \ \A\\^^^ = 9.49929... and 1 1 [| Ail] 1 11.5 = 9.63184.... 

In the final Section of this paper I consider the special case of diagonal blocks, 
and find counterexamples for 3 x 3 block matrices, and counterexamples to 
the extension of ([1]) to < p < 1. 



2 Duality 

A standard duality argument allows to reduce the amount of effort in proving 
Conjectured] by a factor of two. I will show that validity of ([1]) implies validity 
of ([2]). Although seemingly obvious, proving the converse implication turns 
out to be problematic. 



2.1 (QP tmpUes ^ 



Let 1 < p < 2 and let p' be its conjugate exponent 1/p+l/p' = 1; thus p' > 2. 

Using the dual representation of a norm, we can express the left-hand side 
(LHS) of ([2]) in a variational manner as 




max { Re Tr 





< 1 



where the maximisation is over all real scalars si, 

||r||p/ = Tr[TF*], 



sn and ti 



^AT. Thus, 



where Y 



Y 



(s... 



is the optimal Y in the above maximisation, with 



By the assumed vahdity of ([T]), we have 

/lis II 
I Pil Ip • • 

Vll^illp 

The RHS of ([2]) can be expressed as 



< 1. 



RHSfl 



\ ll-Dfellp' • • • 



P' 



N 



:max< ^Sfc||Afc||p/ +tk\\Bk\\p' 





















[h...) 


p ) 



where the maximisation is over all real scalars Si, . . . , sjv and ti, . . . , t^v- Like- 
wise, we have 



\Ak\\p' = max{ReTT[AkGl] : \ \Gk\\p = 1} 
\Bk\\p' = max{ReTi[BkHl] : \ \Hk\\p = 1}. 



Thus we obtain 



TV 

RHS(12])= max Re ^ Tr[A,G*] + Tr[5fci7*], 
under the aforementioned constraints. 

Now note that if we put Sk = \ \Sk\\p, tk = \ \fk\\p, Gk = Sk/sk and Hk = fk/tk, 
we obtain Sk,tk,Gk, Hk satisfying the constraints of the above maximisation. 
Furthermore, with these values we get the identity 

llTlIp, = TT[TY*] = J2TT[AkSl + Bkfl] 

k 

= Y,Ti[AkSkGl + BktkHl] 

k 

= Y,SklAAkGl]+tk1i[BkHll 

k 
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which is real for this particular choice. We conclude that this expression is 
upper bounded by RHS([2]), which is the statement of ([2]). □ 



2.2 Does ^ imply ^? 



One might think that it would be equally easy to prove the converse duality, 
i.e. that validity of ([2]) implies validity of ([T]). This turns out to be not so easy, 
because of an unexpected extra difficulty, and we currently have no proof of 
any converse duality. 

Let us again take p between 1 and 2. Using the dual representation of a norm, 
we again express the LHS of ([1]) in a variational manner as 




max { Re Tr 



N 



p 




(::::) 









■maxlReJ2TT[A,S: + B,T:]: 

Sk,n I 




< 1 

















P' ) 



where the maximisation is over all blocks 5*1, ... , Sn and Ti, . . . , T/v of the 
same size as Ak and Bk- 

Now consider RHS([T]), which can similarly be expressed as 





1 l^fcl Ip ■ ■ ■ ^ 








= max < 




V -Bfc p ■ ■ ■ j 


p 1 



TV 



X! ■Sfcll^fcllp + tk\\B, 



k\\p 



k=l 







> 






= 1 > 




h ] 




[h...) 


p' 



where the maximisation is over all real scalars Si, . . . ,sn- and ti, . . . , t^r. Let 
us denote the optimal values by Sk,tk and introduce the matrix 



X 



Its entries by no means need to be positive, which will turn out to be a 
significant complication. 



Likewise, we have 
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||Afc||p = max{ReTr[AfcGfc] : \ \Gk\\p' = 1} 
\\Bk\\p = max{ReTT[BkHl] : \\Hk\\p' = 1}. 

We denote the optimal Gk and by Gk,Hk. 
Inserting these expressions in the previous one gives 

RHSO = Tr[^fc hGl + 5, 4^:]- 

k 

Comparing this expression for RHS([T]) to the one for LHS([T]), we see that 
LHS > RHS follows if on putting 5*^ = SkGk and Tk = tkHk we obtain a 
matrix 

'hGl 



Y :-- 



hH, . . . 



that satisfies the constraint || H^, < 1. 



If the optimal Sk,tk happen to be non-negative, this is indeed the case. Com- 
bining the constraints on Sk,tk,Gk, Hk with the assumption that Conjecture 
[1] holds for p', we find 



Y\\p>< 





si K^i p' • • • \ 










1^ 4 p' ■ ■ ■ j 


p' 




[h...) 



p' 



Alas, this no longer works if some Sk,ik are negative, because then it can 
happen that 1 1 V | |p/ is greater than 1 (the author is grateful to Chris King for 
pointing this out). 



Take, for example, p = 1.5 {p' = 3) and the 2x2 matrix T with scalar blocks 



/ 



T 



1 2 



V 



-0.1 1 
optimal X is X 



. Its norm compression is A := Cp(T) 
0.488 0.774' 




and the 



-0.153 0.488 
Gi = G2 = H2 = 1 and Hi = —1. This leads to 



(which has 3-norm equal to 1), while 



Y 




0.488 0.774 
+0.153 0.488 
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which has 3-norm 1.0426, i.e. larger than 1. 

Nevertheless, RHS = \\A\\p = Tt[AX*] = Tt[TY*] < Tt[TX*], because of the 
sign patterns, and Tr[rX*] < || T = LHS, because X has 3-norm equal to 
1. In this example, the conjecture itself is not violated, but the implication 
(E]) =^ dl]) does not go through. 



3 Equivalent and Related Forms 



Let us consider first the case p > 2. We put p = 2q, with q > 1, and perform 



the conversion ||T||2q 
both sides: for g > 1, 



|TT*||y2. Inequality then becomes, after squaring 



Efe ^k^l J2k ^kBl 

\ J2k BkAl J2k BkBl 



< 



Hk (^kbk 



J2k (^kbk J2k bl 



(5) 



with ttk = \\Ak\\2g = WAkAlW]/^ and h = H^fclbg = ||5fcS^||i/2. Note that 
both matrices appearing in are PSD. 

In the following I will use the subscript {jk) to indicate the j, k-th block, as 
opposed to the subscript jk without the brackets, which I use to denote the 
j, k-th entry. 

An equivalent formulation of ([5]) is the following 

Conjecture 2 Let Q^*^ = [<5(*fc)]^fc=i arbitrary PSD 2x2 block matrices, 
and ql^^ arbitrary non-negative numbers. The maximum attainable value of 
1 1 Si Q*-*"* I Ig under the constraints \ \Qf^i.\\\q = q^^ is given by 



max \ 

Q(')>0 



HQ 



'{kk)\\<l 



Etq2 



(0 



(6) 



Proof of equivalence of ^ and It is immediately clear that 

/ 

([5]), as can be seen by setting ;= 



implies 



AA* A,B* 
BiA* B,B* 



aj and ^2 



hi 
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To prove the converse, I first show that maximahty of || I]jQ^*''llq under the 
above-mentioned constraints imphes that all Q^'^^ are of the form 



Q 



AA* AB* 
BiA* BiB* 



(7) 



W.l.o.g. we can assume all blocks having the same size (if not, the smaller 
blocks can be padded with zeroes, an operation which does not change the 
Schatten norms). We can maximise 1 1 Yl,i Q^^^ \ \q in two steps. We first maximise 
it keeping the diagonal blocks QI^"*;.) fixed, and then maximise over the diagonal 
blocks while keeping their norms fixed. 

In the first step, positivity of Q(^) requires Q[?2) = (Q(?l))^^^^^*^<5(22))^^^ 
where K^^^ is a contraction. Maximising || Y^iQ^^^lq over all allowed Q^^^ then 
amounts to maximising it over all contractions K^'^\ The set of square con- 
tractions is convex and its extremal points are the unitaries. Since ||.||q is a 
convex function of its argument, || Y^iQ^^^Wq wiH achieve its maximum in an 
extremal point, i.e. in a point where Q(i2) ~ ('5(ii))^^^^*'*''(Q(22))"'^^^; f*^^ some 
unitary U^'^\ Furthermore, this remains so when performing the final optimi- 
sation over the diagonal blocks. This means that the optimal Q*^*^ are indeed 
of the required form ([7D, with Ai = (Q^^^Y^'^, and Bi = U'^'^Qf^^y/'^. 

Applying Q then immediately yields ([6]), since with these assignments we 
have = af, and ^2*'' = ^ 



A (potentially) more general statement than ^ is the following 

Conjecture 3 Let Q = [Q(jk)] and R = [R{jk)] be arbitrary PSD block matri- 
ces, andqk,rk arbitrary non-negative numbers. The maximum attainable value 
of \\Q + R\\q under the constraints \ \Q{kk)\\q = Qk and \ \R{kk)\\q = ^k is given 
by 

max { HQ + R\\q: \ \Q(kk)\\q = qk,\\Rikk)\\q = rk} 

_ [ YkQk J2ky/Wk 
\Eky/Wk Ek^k 




It is immediately clear that ([8]) implies ([5]), as can be seen by setting Q := 
R := [B*Bk], Qk = al and = 6^. When N = 2, the statement of 
Conjecture [3] coincides with Conjecture [2] and is therefore equivalent with it. 
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For larger we don't know whether equivalence holds between Conjectures 
□ andU 



We can further reformulate Conjecture [3] once we realise that it is true for 
matrices Q and R consisting of scalar blocks, which is proven in Section 14.21 
Namely, Conjecture [3] is thus equivalent to the statement that the maximum 
in the LHS of ([8]) is obtained when the blocks of Q and R are essentially 
scalars (i.e. all blocks are proportional to the same matrix). 



In an entirely similar way, we can treat the case 1 < p < 2, which corresponds 
tol/2<g<l. Actually, the equivalences hold in the more general case 
< 5' < 1, although the statements themselves are no longer valid for g < 1/2. 
In the above, we just have to literally replace 'maximisation' by 'minimisation', 
'convex' by 'concave', and reverse the inequalities. 



4 Proofs in Special Cases 



The norm compression has rank 1. 



Let Ak and Bk be such that a| = a^pk and 6| = 13'^Pk, with pfc > and J2kPk = 
1. This is indeed what is meant with T having a rank 1 norm compression. 
The choice of pk is such that they form a probability distribution. By setting 
A-k = s/PkXk and Bk 
reduces to 



^Yfc, with \\XkXl\\q = and \ \YkY^\ 



YkX*k 




< 




for q > 1, and reversed for < g < 1. Now note that the RHS is independent of 
Pk- By convexity of the Schatten norms for q > 1, and concavity of the Schatten 
quasi- norms for < g < 1, it is therefore enough to prove this inequality for 
the extremal points {pk)k = (0, • • • , 0, 1, 0, . . . , 0), which amounts to the case 
that = 1. That is, we need to prove 

^XiX* XiY{ 







i al3\ 


) 


< 




<1 





As the left-hand side is equal to || X^Xi + Y*Yi \ \^, and the right-hand side is 
equal to a"^ + P"^ = \ \XlXi\\q + \ \Y{Yi\\q, this simply follows from the triangle 
inequality; that is, again from convexity of the Schatten norms for g > 1. 
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The reversed inequality for < g < 1 follows similarly from concavity of the 
Schatten g-quasi-norms on PSD matrices. □ 



4-2 All Ak and Bk have rank 1. 



In this case, Ak and Bk can be written as 

Ak = akU*k, Bk = I3kvl, 

with ttfc, [3k, Uk and Vk vectors, such that ||Q;fc|| = cifc, \ \(3k\ \ = bk, 
\\vk\ \ = 1. Then | |^a:| |p = CLk and | |p = bk for all p > 0. 

The left-hand side of (E]) then can be written as 

/ 



\Uk\ 



1 and 



LHS® 



Efcafcttfc Y.kiu*kVk)akf3l 
\Y.kKuk)Pkal EkPkPl 



Introducing K = ®kU*kVk, and denoting by A and B the matrices whose 
columns are ak and Pk, respectively, this is equal to 



LHS® 



AA* ARB* 
BK*A* BB* 



The g-th power of the LHS equals the trace of the g-th power of the given 
block matrix, which is PSD. Therefore, for g > 1, the g-th power of the LHS 
is convex in K and K* , while it is concave in it for < g < 1. Now note that 
is a diagonal contraction. Since the diagonal contractions form a convex set 
with extremal points the diagonal unitaries, the LHS is extremal for K equal 
to some diagonal unitary U . Therefore, for g > 1, introducing the matrices 
Q = A* A and R = UB*BU*, 



LHSflSD < 



AA* AUB* 
BU*A* BB* 



A* A + UB*BU* 



Q + R\ 



whereas, for < g < 1, the direction of the inequality is reversed. Note that 
the diagonal entries of Q and R are given by Qkk = iA*A)kk and, since U is 
a diagonal unitary, Rkk = {B*B)kk- 

The scalars and bk appearing in the right-hand side of ([5]) are thus given 

a/Q^, and bk = \\Pk\\ = 



by ak = \\ak\\ = ^J{A*A)kk = \fQhk, and bk = \\(3k\\ = ^{B*B)kk = VRkk- 
Hence, the right-hand side of ([5]) depends only on the diagonal elements of Q 
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and R. Defining := Qkk and := Rkk, it is given by 



RHS© 




with y := J2k y/Ok^k- In other words, the special case under consideration 
follows from the special case of Conjecture [3] where the blocks of Q and R are 
scalar, and which I will now prove. 

In fact, it holds for any unitarily invariant (quasi)norm, and not just for the 
Schatten norms. More precisely. 



+ R\\\< 



TiQ y 
y Tt R 



(9) 



Furthermore, for < g < 1, the reversed inequality holds for the Schatten 
quasi- norms. To prove this, I only have to show ([9]) for the Ky Fan norms 1 1 . 1 1 (fc) , 
k = 1,2, . . . ,d, because I can then invoke Ky Fan's Dominance Theorem. This 
Theorem is formulated for norms, but can easily be generalised to concave 
quasi-norms. If a Hermitian matrix A dominates another Hermitian matrix B 
in the sense that its eigenvalues majorise those of the other, \{A) >- X{B), 
then A will be dominated by B in any unitarily invariant concave quasi- 
norm. In particular, it will be dominated in any Schatten g-quasi-norm with 
< g < 1. This follows directly from the statement that x -< y implies 
J2'j=i(p{xj) > Z)j=i for concave functions (p (see, e.g. Theorem II. 3.1 in 
[1] mutatis mutandis for concave (p). 

Thus we only have to prove (Q for the Ky Fan norms. We need only consider 
two cases. The case > 1 is trivial, as the right-hand side is then nothing but 
the trace of the matrix. This trace is Tt{Q + R), which is obviously an upper 
bound on \ \Q + -R| |(fc)- 

The remaining case k = 1, i.e. the operator norm, or g = oo, is only slightly 
more complicated. I will prove for the operator norm by showing that the 
right-hand side is the maximum of | |Q+-R| |oo over all Q, i? > with prescribed 
diagonals Qkk = Qk, Rkk = fk- In fact, the following Lemma is even a little 
more general: 

Lemma 1 Let qk, Vk, Sk, ■ ■ ■ , with k = 1, . . . , d, be given non-negative num- 
bers. The maximal value o/||(5 + -R + S' + ...||oo over all d x d PSD matrices 
Q, R, S,. . . with prescribed diagonal elements qk, rk, Sk, ■ ■ ■ , respectively, is 
obtained when Q, R, S,...are rank 1 matrices with non-negative elements. 
That is, Qij = y/qiqj, etc. 
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Proof. Since Q, R, S,. . . are Hermitian, the norm ||Q + i? + S' + ...||oois given 
by max^ ip* {Q + R + S + . . .)ip = max^ ip*Q4' + ip*R'>P + • • •, where ip is any 
normahsed vector. Now, for any vector ip we have the inequahty 

k j<k 

<^\M^qk + 2E IV'illV'feiy^, 

k j<k 

where \Qjk\ < y/QjQk is required for positivity of Q. Letting ipj = exp{i(j)j)\iljj\, 
equahty is obtained when Qj^ = exp^i^cpj — (l)k))-y/QjQk- Since the maximal 
value of ip*Qip for this Q does not depend on the arguments (pj, we could as 
well restrict ip to positive real vectors. As the above is true for any value of 
ip, \\Q + R + S + . . .\\oo is maximal for Qjk = ^/qJOk, Rjk = y/rfn, ■■■ ^ 

With this Lemma, we are done. □ 

Remark: The Lemma cannot be generalised to other norms, because the ex- 
tremal points of the set of positive matrices with prescribed diagonal entries 
do not necessarily have rank 1, even in the simplest case that these diagonal 
entries are all 1 (the case of so-called correlation matrices 

4-3 All Ak are proportional to each other, and so are all 

Let us now consider the case where the Aj. satisfy A^ = a^X, for some scalars 
ak and some matrix X, and similarly, = PkY . For this special case we need 
a result by King from [6] : 

Proposition 1 (King) For Aj., B^ > 0, and any q > 1, 



Y^Ak^Bk 


< 


T.Ak 


max LB,- , 


(10) 


k 


1 


k 







while for < q < 1, 



J2Ak0Bk 


> 


E^^ 


min LB, . 


(11) 


k 




k 


<? 





Proof. Since the A^ are positive, the following notations are meaningful: 

F = (i/a^ ® 11 . . . ® 11) 
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it 



Let us denote by X(fcfc) the k-th diagonal block of a matrix in the same parti- 
tioning as H. For example, H(^kk) = 11 CS) Bk. 

Using these notations, J2k ® -Bfc can be written as FHF*. The Araki-Lieb- 
Thirring inequality (pLj, Theorem IX. 2. 10) can be easily brought in a form 
that states that, ior H > and general F, Tr[(Fi7F*)«] < Tr[(F*F)« iY^] 
holds for q > 1, and the reversed inequality for < g < 1. Thus, for q > 1, 



k 



= Tr[{FHF*y] 

<Ti[{F*Fy H'^] 

= 5:Tr[[(F*Fn(,,)(ll®5D] 

k 

= j:n[{G*G)%,)] Tr[Bl] 

k 

<maxTr[5|] J] Tr[[(G*G)'?](fc,) 
^ k 

= maxTr[m] Tr[(G*G)'']. 
j 



while for < g < 1, the direction of the inequalities reverse, and the 'max' 
has to be replaced by a 'min'. Then noting 

Tr[(G*G)'^] = TiliGGy] = Tr[(5: A^)"], 



and taking g-th roots yields the Proposition. □ 
This proposition can be reformulated as: 
Corollary 1 For Ak, Bk > 0, and any q > 1, 



Y^Ak^Bk 

k 



< 



y^^Ak\\Bk 



(12) 



while for < q < 1 the inequality reverses. 

Proof. Define A'f^ = \\Bk\\gAk and B'f, = Bk/\\Bk\\q. Then H-B^Hg = 1 and, for 
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k 



Y.A'k®B', 



k 



< max 



k 



The proof is completely similar for < g < 1. □ 
What we need now is: 

Corollary 2 For general matrices X^, k = 1,. . . ,d, with \ \XkX^\\q = 1 and 
Z = [zij]fj^i > 0, the block matrix Q := [zijXiX*] satisfies \ \Q\\q < \\Z\\g for 
q>l and \\Q\\q > \\Z\\g forO<q<l. 

Proof. We can write 

Q=i®^Xi) (Z®11) {®iX*), 

thus Q is unitarily equivalent with 

{Z'/^ ® 11) {(BiXiX*) {Z'/^ ® 11) 

and has the same g-norm. Denoting the columns of Z^/^ by i/i, the latter 
matrix can be written as 

Y^y,y:®x,x:. 

i 

Thus, WQWq = II J^iyiUi ® ^i^iWq^ wMch by the previous Corollary is upper 
bounded by \ yiViW^i^iWq \ \q, ior q > 1. By the assumption | \XiX- 
this upper bound reduces to \ Villi 



t \ \q 



i \ \q 



For < g < 1, the proof is completely similar; replace 'upper bound' by 'lower 
bound'. □ 



Now put Ajfc = flfc-^) Bk = bkY, with X and Y such that | |XX*| |q 
1. With this choice the left-hand side of becomes 



\YY* 



{Ekal)XX* {Ekakbk)XY* 
{Ekakbk)YX* {Ekbl)YY* 



and the validity of ([5]) in this case follows directly from Corollary [21 with 



J2k (^k ^k O-kbk 
J2k (^kbk J2k b\ 



> 0. 
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4-4 The case q > 2 



For this case, which corresponds to p > 4, I start with a simple Lemma about 
2x2 PSD matrices; the positivity is essential. 



Lemma 2 Let Aj 



Ci hi 



be 2 X 2 PSD matrices with non-negative el- 



ements. For any UI norm |||.|||, if ai < a2, hi < 62 o-nd Ci < C2, then 

\\\M\\ < 

Proof. By Ky Fan's dominance theorem [3JT], one only needs to show the 
statement for the trace norm and operator norm. Since Ai and A2 are PSD, 
their trace norm equals their trace, and obviously Tt Ai < Ti A2. For the 
operator norm, note that 



l^il 



max 'ijj*Ai'ijj = ailijjif + 6i|'?/'2|^ + 2ci Re^jJl^jJ2■ 



Since ci > 0, in order for the maximum to be achieved, Re?/'i?/'2 must be 
positive. In that case one sees that il)*Aiil) < tp*A2ip, so the same must hold 
for the respective maximums. □ 

From King's norm compression inequality for PSD 2x2 block matrices men- 
tioned in the Introduction (i.e. ([2]) with N = 2 and T > 0), it follows that, for 

g>2. 



\ Y.k BkAl 'Zk BkBl 



< 



1 1 J2k AkAlWq 

\ I I J2k BkAlWg 



By the triangle inequality, and the Cauchy-Schwarz inequality for matrices 
([Ij 1X.30), the matrix elements of the right-hand side are upper bounded as 
follows: 



\\Y.AkAl\\g<J2\\AkAl\\g = J2al 

k k k 

\\j:BkBl\\,<j:hl 

k k 

k k k k 

Application of Lemma [2] then immediately proves ([5]) for q >2. □ 
Remarks. 
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(1) For 1 < q < 2 one might be tempted to apply the NCI proved in |5] 
instead of King's, but, unfortunately, it gives a bound which can be higher 
than ©. 

(2) Note also that, since we don't have the required duality result, this case 
does not allow us to treat the 1 < p < 4/3 case. The only possibility to do 
so would be to use a variant of King's inequality for q < 1, which we don't 
have. Preliminary numerical experiments indicate that for < g < 1, and 
T a 2 X 2 PSD block matrix the following holds: 

l|r||,<||C,(T)||^, 

just as in the q >2 case, which is in the wrong direction. 

(3) Similarly, the NCI proved in [S] also seems to work for < g < 1, with 
the same ordering as for q > 2. 



5 Diagonal blocks, the < p < 1 case, and a counterexample for 
3x3 block matrices 



In this final Section, we study the special case where all blocks are diagonal. 
For N = 2, this case has been considered and proven in [8]. The proof proceeds 
via reducing the problem to the following convexity result: 

Lemma 3 Define the (homogeneous) real-valued function 



B ^ g^{B) 



on M2,2(I^^); where B°^^p denotes the entry-wise (Hadamard) power of B. 
Then gp{B) is convex for 1 < p < 2, while it is concave for 2 < p. 

Numerical experiments quickly reveal that for < p < 1, gp{B) is neither 
convex nor concave. 



With this Lemma at hand, it is straightforward to prove ([2]) for diagonal 
blocks. The question thus arises whether the Lemma remains true for matrices 
in M2,Ar(M^), or even for general non- negative matrices. 

Proof that ^ for diagonal blocks is implied by Lemma\^ We present the proof 
in the most general setting of matrices of arbitrary size, although we will show 
below that the Lemma does not hold for matrices of size 3x3 (and larger). 

Consider the case p > 2. 

Let the block matrix under consideration be denoted T = [T(^ij)] and let a^j 
denote the k-th diagonal entry of T(^ij)- With these diagonal entries we can 
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construct d matrices By a simple unitary conjugation, T can thus be 

brought into the direct sum form 0fc=i[afj]ij, so that 



17^1 IP 
r \\p 



E 



Every matrix in this sum can be trivially norm compressed by applying the 
absolute value to every matrix entry. For this norm compression, the NCI ([2]) 
applies, because this is a special instance of the case where all blocks are rank 
1. For p >2, this gives 



\\t\\i<y: 

Since all matrix entries are now non- negative. Lemma [3] can be applied, giving 











P 




||T||P < 

1 r Mp — 




. k 




P 


[11%) Hp] 



One observes that the right-hand side is just the right-hand side of (12]) • 

For 1 < p < 2, we proceed in a completely similar way. □ 

Upon closer inspection, we see that validity of Conjecture [T] in the case of 
positive diagonal blocks is equivalent to validity of Lemma [31 Hence, coun- 
terexamples to the conjectured validity of Lemma in the 2 x N case yield 
counterexamples to Conjecture [H 

For < p < 1, we already noted that gp{B) is neither convex nor concave, 
whence ([1]) is not true for < p < 1. This is exhibited by T consisting 
of positive 2x2 diagonal blocks, constructed from two matrices in M2(M+) 
that violate convexity of gp{B) (i.e. from A and B such that gp{A + B) 
9{A) + g{B)). 

Proving the Lemma for 2 x N matrices turns out to be surprisingly hard. The 
proof of the 2x2 case in [8J is already very involved and no simple method 
seems to be forthcoming yet. In the remainder of this Section we introduce 
a certain direction along which a proof may eventually be found; using this 
method we do find a counterexample for Conjecture [T] in the 3x3 case. 



Let us focus attention to the function gp{B) defined above and to the case 
1 < p < 2. Following [8] we express the convexity using a differential. By 
homogeneity of gp, its convexity is equal to its subadditivity; i.e. we have to 
show that for B,A e M(M+), gp{B + A) < gp{B) + gp{A). Replacing A by 
tA, t > 0, this yields 

gpiB + tA) <gp{B)+tgpiA). 
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For infinitesimal t we get the requirement 

gp{B + tA) <gp{A). 



d_ 

di 



This is actually an equivalent statement to the former one, as can be seen by 
integrating over t. 

The derivative can be calculated explicitly. For that purpose we need to know 

Tt\B + tA\P. For positive B 



the Frechet derivative of a Schatten norm | 



and A, the absolute value is not needed and we simply get 

Tr(5 + tAY = pTi{BP-^A). 



d_ 

dt 



In the general case one can show that the {p — l)-th power of B has to be 
replaced by the quantity {BB*Y^'^~^B. In terms of the polar decomposition 
B = U\B\, this quantity is equal to \B*\p^^U, which is why I propose to call 
this quantity the {p— l)-th polar power of B. I will use the shorthand S''"' for 
the r-th polar power: 

:= (BB*f-^^I^B. 

Some obvious statements one can make are that the polar power coincides with 
the ordinary power on PSD matrices, and that UI norms don't distinguish 
between polar powers and ordinary powers: | WB^"^^ \\ \ = \ \\ IB]'' \\\. 

With the derivative of the Schatten norm at hand, the Frechet derivative of 
gp{B) can be calculated, and the result of a somewhat lengthy calculation is 



d_ 

dt 



9p 



{B + tA) = Tr A^((5°^/P)[P-i] o B°^^-P^/P). 



t=Q 



Recall that o denotes the Hadamard product, and B°^ denotes the r-th entry- 
wise power. With the substitutions A := A°^/^ and C := 5°^/^, the statement 
we have to prove becomes 



Tr(A°P)^(C[p-il o C°^-P) < Tr \A\p 



(13) 



for 1 < p < 2. Note that equality holds when A and C are proportional. 



I have not been able to prove inequality 0131] for A,C & M2^Ni^^)- However, 
by studying what happens when C is on the boundary of M(]R~^) (that is, 
when some entries of C are 0) I have been able to find a counterexample for 
M3(M+), which, as mentioned above, automatically yields a counterexample 
to Conjectured] for 3x3 block matrices. 
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Consider p = 1.5, say, and the matrix 



e 1 e 



C 



1 e 1 



V 



e 1 1 



J 



where e will tend to 0. One can easily calculate the polar power of C numeri- 
cally. We will only need entry 1, 1 to see the violation of the inequality here. 
The result is 



where the only thing that matters here is that this entry is strictly positive. 
Indeed, the 1, 1 entry of C°^~^ is given by e~°'^, which tends to +oo as e tends 
to 0, and, therefore, the 1, 1 entry of {C^^^^^ o C°^^^) tends to +oo as well. 
Hence, for any A with non-zero 1, 1 entry, the inequality ([T51) is violated to 
the maximally possible extent. 



In the light of this counterexample, one may get concerned about validity 
of ([13]) in the 2 X well. I end this Section by showing that the 

phenomenon just mentioned, of certain entries of {C^^^^ o C°^~^) tending to 
+00, will not occur in 2 x matrices (but leaving open the general question 
of validity of (fT3|) ). Indeed, we have the following Lemma: 

Lemma 4 Let C be a bounded matrix in M2,Ar(M+), 1 < p < 2. // entry 
Cij = 0, then the corresponding entry {C^'''~^^)ij is non-positive. 

It is, of course, this non-positivity that rescues inequality fll3l) here. If entry 
Cij is zero, entry iC°^'P)ij tends to +oo, so that {C^P'^^ oC°^-P)ij is either or 
— oo. In the former case, it does not contribute to the LHS of (fT3|) . while in the 
latter it is a dominating contribution (at least, for A with non-zero entries) 
and causes (ITSl) to be satisfied irrespective of the value of any other entry. 



Proof. By definition, C^p~^^ = \C^\p-'^C. Here, p - 2 < 0. Now, |C^| is a PSD 
2x2 matrix with non-negative entries. 



This is also true for any positive power of |C^|. Indeed, a useful parametrisa- 
tion of PSD 2x2 matrices with non-negative entries is A = XP + /u(ll — P), 
where A > ;U are the eigenvalues of A, and the projector P is given as 



lim(C[°-^l)i,i = 0.11669..., 



P 




with < t < 1 and s 



^t{l - t) e [0,1/2]. Obviously, 
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for any p, = X^P + /i^ (11 - P). Written out, 

\ s{Xp-fiP) (1 -t)Ap + t/iPy ' 

which is clearly entry-wise non-negative for all p > 0. For negative p, on the 
other hand, we see that {Ap)i2 reverses sign (or stays 0). 

Since p — 2 is indeed negative, we thus find that {\C'^\p~^)i2 is non-positive. 
Consider then the zero entry Cij = (the reasoning is identical for row 2). 
Then 

which is non-positive. □ 
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